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Abstract 

We provide a self-contained, accessible introduction to Ratner's 
Equidistribution Theorem in the special case of horocyclic flow on a 
complete hyperbolic surface of finite area. This equidistribution re- 
sult was first obtained in the early 1980s by Dani and Smillie |DS84j 
and later reappeared as an illustrative special case |Rat92j of Rat- 
ner's work [Rat91-Rat94] on the equidistribution of unipotent flows 
in homogeneous spaces. We also prove an interesting probabilistic 
result due to Breuillard: on the modular surface an arbitrary uncen- 
tered random walk on the horocycle through almost any point will fail 
to equidistribute, even though the horocycles are themselves equidis- 
tributed [BreOSj . In many aspects of this exposition we are indebted 
to Bekka and Mayer's more ambitious survey [BMOO] , Ergodic Theory 
and Topological Dynamics for Group Actions on Homogeneous Spaces. 



1 Horocycle flow on hyperbolic surfaces 

Let X be a complete hyperbolic surface, perhaps the hyperbolic plane H, 
and let X denote the unit tangent bundle T^{X) to X (and H = T^H). 
There are three flows on X which will concern us here. They are realized by 
three represented in Figure [H 
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Figure 1: Driving the cars above leads to geodesic flow, positive horocycle 
flow and negative horocycle flow respectively 



The cars all have their steering wheels locked in position: the first car 
drives straight ahead, the second one steers to the left so as to follow a path 
of geodesic curvature 1, and the third steers to the right, also following a 
path of geodesic curvature 1. All three cars have an arrow painted on the 
roof, centered at the rear axle; for the first the arrow points straight ahead, 
and for the other two it points sideways - in the direction towards which the 
car is steering for the second car and in the opposite direction for the third. 

The flows at time t G M starting at a point x = (x, ^) G X are defined as 
follows: 



1. The geodesic flow, put the first car on X with the arrow pointing in 
the direction of ^, and drive a distance t. The point of arrival, with the 
arrow on the car at that point, will be denoted ^g{t); 

2. The positive horocyclic flow: put the second car on X with the arrow 
pointing in the direction of ^, and drive a distance t. The point of 
arrival, with the arrow on the car at that point, will be denoted x'u_|_(t); 

3. The negative horocyclic flow: put the third car on X with the arrow 
pointing in the direction of ^, and drive a distance t. The point of 
arrival, with the arrow on the car at that point, will be denoted x'u_(t). 

We will see when we translate to matrices why it is convenient to write 
the flows as right actions. 

The trajectories followed by these cars are represented in Figure [2j 
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Figure 2: In the upper half-plane model of the hyperbolic plane, the geodesic 
passing through {x, ^ ) is the semicircle perpendicular to the real axis and 
tangent at x to ^. One should remember that it is not a curve in H, but 
rather a curve in T-^iM) and carries its velocity vector with it. From the 
point {x,^), the positive horocycle is the circle tangent to the real axis at 
the endpoint of the geodesic above and perpendicular to { at x, whereas the 
negative horocycle flow is the circle tangent to the real axis at the origin of 
the geodesic, and still perpendicular to ^ at x. We have drawn our tinkertoys 
driving along them. 
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2 Translation to Matrices 

In less picturesque language (more formal, not more accurate) , you can iden- 
tify X with r \ PSL2 M for some Fuchsian group T. 

1. The geodesic flow of the point represented by g & PSL2 M is 



e 2 



2. The positive horocyclic flow of the point represented hy g E PSL2M is 

1 t 



t ^ g 



1 



3. The negative horocyclic flow of the point represented hy g e PSL2 M is 

1 



1 1-^ g 



t 1 



The standard left action of PSL2 M on H, which lifts by the derivative to 
a left action on H is given by 



a b 
c d 



z — liftmg to 

cz + a 



a b 
c d 



az + b 



cz + d^ {cz + dy 



We can then identify PSL2 IR to H by choosing Xq = (i, i) E H and setting 
$ : PSL2 R ^ H to be 



a b 
c d 



a b 



ai + b 



c d j'^ \ci + d' {d + df 



Since 



$(7^) = (7A) ■ xo = 7 • • xo) = 7 ■ HA) 

we see that $ induces a diffeomorphism $r : F \ PSL2 M — > F \ X. 

The left action above does not induce an action of PSL2 R on F \ X, but 
there is an action on the right given by 
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For 7 e r we have 



$p(A) * 5 = $r(A5) = $r(7A5) = 7 • <^r{AB) = 7 • ($r(^) * 5) 
so the action is well defined on X. All three fiows are special cases, eg. write 

''-(o e"0^ -'^-{11)^ ---ill) 
and name the corresponding one-parameter subgroups 

G = {G'\teR}, U+ = {Ul\teR}, U_ = {U'_\ t eR}. 

Then 

Let us check these. By naturality we see that for all A e PSL2 M we have 
(^•xo)^(i) = ^•(xo^(i)), (A-xo)ii+(i) = A(xoii+(i)), (^•xo)^/-(t) = A-(xoii-(t)) 

and, moreover 

$(6-*) = xo^(i), ^{UX) = Xoii+(i), = xo«_(i) 

so 

= (AG*) ■ xo = A ■ (G* ■ xo) = A ■ (xo^(t)) = (A ■ xo)5(t) = 
and the argument for w+ and is identical. 

Left multiplication by G*, and also give flows on H; probably 

easier to understand than the geodesic and horocycle flows. For instance, left 
action by U\. corresponds to translating a point and vector to the right by 
t. But these actions do not commute with the action of F and hence induce 
nothing on X. 

Since PSL2 M is unimodular, it has a Haar measure, invariant under both 
left and right translation, and unique up to multiples. Since PSL2 R is not 
compact, there is no natural normalization. Denote by 00 the corresponding 
measure on H; if X = r\H is of finite volume, we will denote by cux the 
corresponding measure normalized so that a;x(X) = 1. Up to a constant 
multiple we have 

dx Ady A dO 
^= 2 ' 
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where we have written x = {z,$,) and z = x + iy, = ye^^ (the factor y 
is there to make it a unit vector): this measure is easily confirmed to be 
invariant under both left and right action of PSL2 M on H. 

Occasionally, we will need a metric and not just a measure on PSL2 M; we 
will use the metric that corresponds under $ to the Riemannian structure 

dx^ + dy^ 2 
5 h de . 

This metric is invariant under left action of PSL2 M on H, and as such does 
induce a metric on X. It is not invariant under right action, and the flows 
M+, M_ and g do not preserve lengths. 

3 The Horocycle Flow is Ergodic 

Theorem 1 IHedSdf The positive and the negative horocycle flows are er- 
godic. 

We will show this for the positive ergodic flow. To prove Theorem 1 we 
will show that any / G -^^^(X) invariant under the horocycle flow is constant 
almost everywhere. Indeed, if the positive horocycle flow is not ergodic then 
then there is a measurable set Y G X of positive but not full measure that is 
invariant under and the characteristic function 1y provides a nonconstant 
invariant element of L^(X). 

Lemma 2 For f e L\X.), A G PSL2 M an(ix G X, let{TAf){x.) := /(x*A). 
Then the function Fj : PSL2 M ^ M defined by 

Ff{A)= [ /(x)/(x*AVxW ■.=<f,TAf> 
Jx 

is 

(a) uniformly continuous and 

(b) bi-invariant under U+, i.e., invariant under the left and the right 
action of on PSL2 M. 

Proof of Lemma [2] (a) Choose e > 0. Since the continuous functions 
with compact support are dense in L^(X), we can flnd a function g G Cc(X) 
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with 11/ — (7112 < £^/3. The fact that such a g is uniformly continuous means 
that 35 > such that 

d{A,B)<6^\\TAg-TBg\\2<^- 

So when d{A, B) < 6 we have 

||Ta/ - Tsfh < WTaI - TAgh + \\TAg - Tegh + WTeg - T^/lb < e 

We see that A i— > T^/ is a uniformly continuous map PSL2M — i^^(X) and 
(a) follows. 

(b) Biinvariance reflects the invariance of Haar measure on PSL2 M under 
left and right translation: for A G PSL2 M we have 

/ /(x)/(x*(Af/^)Vx(rfx)= / /(x)/((x*AK(t)VxW 

JX JX 

/ /(x)/(x*AVx(rfx) = F^(A); 

JX 

/ /(x)/(x*(t/^A)Vx(rfx)= / /(x*A-i)/(x*[/^Vx(c^x) 

JX JX 

/ /(x*A-i)/(xn+(tVx(t^x)= / /(x*A-i)/(xVxW 

JX JX 

/ /(x)/(x*AVx(rfx) = F^(A) 

JX 

Proof of Theorem 1. What do the biorbits of f/+ look like in PSL2M? 
Using our identification $ : PSL2 M ^ H, we can think of the biorbits as 
living in H with geometry represented in figure [31 More specifically, there 
are two kinds of biorbits. The first (exceptional) kind of biorbit consists of 
all upwards pointing vertical vectors anchored at points z = x + iy with a 
given ?/-coordinate. The union of these orbits forms a plane V C H. The 
other (generic) biorbits consist of the vectors defining horocycles of a given 
radius tangent to the x-axis: each such biorbit is diffeomorphic to a plane. 

In particular, all the biorbits are closed, and there is nothing to prevent 
the existence of nonconstant continuous functions on H that are constant 
on biorbits. But our function Fj is uniformly continuous, and that changes 
the situation: every uniformly continuous function on H that is constant on 
biorbits is constant. What we need to see is that for every e > 0, 



Ff{AUX) = 
FfKA) = 
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Figure 3: Since the left and the right action of f/+ commute, the biorbits are 
homeomorphic to M^, except the orbits on which the two actions coincide. 
Viewed in H, the orbits are the 1-parameter family of folded planes (the ham 
slices in the sandwich). The top and the bottom planes should be identified; 
they represent the orbits formed of vertical upwards pointing vectors; those 
orbits are lines, as drawn in the planes. The salient features of the figure is 
that any two points of the top plane are within e of a single biorbit (in fact 
all of those with folds sufficiently far to the left), and every biorbit comes 
arbitrarily close to a biorbit consisting of vertical upwards pointing vectors. 

• any two elements of V can be approximated to within e by a single 2- 
dimensional biorbit, and 

• that any 2-dimensional biorbit is within e of some element of V. 

These features are illustrated, but not proved, by figure [31 The proofs are 
the content of the two parts of figure HI 

Since Ff is uniformly continuous, for any e > we can find a 5 such 
that d{z,z') < 6 ^ \\Ff{z) — Ff{z')\\ < e/2. Choose any two points z = 
{x + iy,^),z' = {x' + iy', G V, and assume without loss of generality that 
y > y'. Choose r] a nonvertical vector for which d{{x + iy,^), {x + iy,ri)) < S. 
Set w" = {x" + iy', r]') is the point on the the positive horocycle through 
w = (x + iy.,!]) at "height" y', and w' = {x' + iy', t]'). The vector rj' is more 
vertical than rj, so d{{x" + iy',rj'), (x" + iy',^')) < 5. Further, w' and w'" 
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Figure 4: Left: Any two upward-pointing vertical unit vectors can be 
approximated by elements of the same biorbit. Right: Any orbit contains 
vectors arbitrarily close to upward-pointing vertical vectors. 

belong to the same biorbit. Thus 
||FKz)-F^(z')l| 

< \\Ff{z) - Ff{w)\\ + \\Ff{w) - Ff{w")\\ + \\Ff{w") - Ff{w')\\ +||Fj(w') - Ff{z')\\ 
<| + + + | = e. 

The right side of fig ?? shows that every two-dimensional biorbit contains 
vectors arbitrarily close to vertical, ie. arbitrarily close to V just taking the 
vector perpendicular to the horocycle sufficiently close to the real axis. 

Using biinvariance, we see that Ff is constant on V; evidently this con- 
stant is II/II2 = Ff{'Xo). By the Cauchy-Schwarz inequality (and the fact that 
we chose / real), we see that for all A G PSL2(M), T^/ = ±/ in L'^(X.) with 
the sign depending continuously on A. Since PSL2(M) is connected, / is a 
constant element of -Z^^(X) and the theorem follows. ■ 

4 Equidistribution of the horocycle flow 

The main result of this paper is theorem [3J 

Theorem 3 JDS8^ Let X be a complete hyperbolic surface of finite area. 
Then every horocycle on X is either periodic or equidistributed in X. 

This theorem is evidently a much stronger statement than that the horo- 
cycle flow is ergodic, or even that it is uniquely ergodic. It is not an "almost 
everywhere" statement, but rather it asserts that every horocycle is either 
periodic or equidistributed in X. 
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Note that this depends crucially on the fact that horocycles have geodesic 
curvature 1. The statement is false for geodesies (geodesic curvature 0): 
geodesies can do all sorts of things other than being periodic or equidis- 
tributed. For instance, they can spiral towards a closed geodesic, or be dense 
in a geodesic lamination, or spiral towards a geodesic lamination. Curves 
with constant geodesic curvature < 1 stay a bounded distance away from 
a geodesic, and hence can do more or less the same things as geodesies; in 
particular, they do not have to be periodic or equidistributed. 

On the other hand, curves with geodesic curvature > 1 are always peri- 
odic, hence never equidistributed. 

The horocycle flow is also distinguished from flows along curves of geodesic 
curvature less than 1 by having entropy 0. We will not define entropy here, 
but whatever definition you use it is clear that if you speed up a flow by a 
factor a > 0, the entropy will be multiplied by a. But the formula 

G{-s)U+{t)G{s) = U+{exp{-s)t) 

shows that the horocycle flow is conjugate to itself speeded up by exp(— s), 
thus its entropy must be 0. 

5 The Geometry of Flows in H 

The geodesic flow in X has stable and unstable foliations: two points xi, X2 G 
X belong to the same leaf of the stable foliation if d{'Xig{t),:s.2g{t)) is bounded 
as t +00, and they belong to the same leaf of the unstable foliation if 
(i(xi(7(t), X25'(t)) is bounded as t ^ — oo. These foliations are very easy to 
visualize in T^H, as shown in Figure O 

Note that the stable leaves are fixed by the positive horocycle flow: the 
positive horocycles are the curves orthogonal to the geodesies in a leaf. Sim- 
ilarly, the unstable manifolds are fixed by the negative geodesic flow, and the 
negative horocycles in a leaf are the curves orthogonal to the geodesies in 
that leaf. 

On the other hand, the positive horocycles are transverse to the unsta- 
ble manifolds, and positive horocycle flow does not send unstable leaves to 
unstable leaves. 

Let Sx be the unstable manifold of the geodesic flow through x. Define 
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Figure 5: The tangent vectors to geodesies ending at the same point at 
infinity form one leaf of the stable foliation for the geodesic fiow. Similarly, 
the tangent vectors to geodesies emanating from a point at infinity form a 
leaf of the unstable foliation. In X, these leaves are tangled up in some 
very complicated way (after all, most geodesies are dense, never mind their 
stable and unstable manifolds). But clearly each leaf is an immersed smooth 
surface, hence of measure in X. 

5'x(a, b) C Sy, by 

S'x(a,&) = {xM_(r)(7(s), |r| < a, \s\ < b}. 

We will refer to S'x(a, &) as a "rectangle"; it isn't really: it is a quadrilateral 
bounded by two arcs of geodesic of length 2b, and be two arcs of negative 
horocycle, of length respectively ae'^ and ae~^ (see figure ED- 

Further we define the "box" PVx(a, c) C X as the region obtained by 
fiowing along positive horocycles from S'x(a, b) until you hit 5'xm+(c)- Because 
S:x.u+{c) is actually dense in X, you have to understand the fiow as taking 
place in the universal covering space H, and then projecting the "box" to X 
(see figure E] again) . 

Each surface Sx is invariant under geodesic fiow, but the "rectangles" 
5'x(a, b) are not; instead we have 

S^{a,b)g{s) = S'xg(,)(e'a, 6). 

Moreover, the surfaces Sy_u+(s) foliate a neighborhood of the positive horo- 
cycle X * through X, and thus there is a function ax(y,t) : M ^ M for 
y G ^x (for its precise domain, see below) such that 

yM+(ax(y,t)) e S^u+it)- 

as sketched in Figure [61 
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Figure 6: The surfaces 5'x„_,_(s) foliate a neighborhood of the positive horo- 
cychc orbit of x. Thus, for every t and every y G S'x sufficiently close to x, 
there is a time Q;x(y,t) such that the positive horocycle yM_|_(R) intersects 



The function t h- > Q;x(y, t) is defined in [0, T(y)) for some T{y) that tends 
to oo as y — > X. Moreover, the function is C°° (actually real- analytic) by the 
imphcit function theorem, and ^Q;x(y,i) tends to 1 as y ^ x, so that 

hm ^ 1. 

y-*x t 

It isn't often that you can replace the implicit function theorem by an 
explicit formula, but this does occur here. 

Lemma 4Ify — x.u-{r)g{t), then 

ttx(y,s) = — r. (2) 

e*(l — rs) 

In particular, a^iy, s) is defined iny E S'x (a, b) for all s < 1/a and all h, 
and 

hm — ax(y, s) = 1, hm = 1. 

y^x ds y^x s 
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Proof. This is a matter of solving the equation 

xM_(r)5((t)u+(ax(2/, s)) = x.u+{s)u_{p)g{T), 
i.e., the matrix equation 



1 

r 1 



et/2 






-t/2 





1 a 




1 s 




1 






1 




1 




P 1 





er/2 





-r/2 



This is a system of 3 equations (because the determinants are all 1) in 3 
unknowns p, t and a. Just multiply out and check. ■ 

The central result here is the following: 

Lemma 5 There exists a constant C such that for all < 6 < 1/2, allt > 0, 
all y e >S'x(f , 5) and all < s < t we have 

c/(xM+(s),yM+(«x(y, s)) < C5. 

Note that ax(y)'S) is defined for s < t when y G S'x(f,5) and 6 < 1/2, 
since for the factor 1 — rs from the denominator of formula [21 we have r < S/t 
and s < t, so 1 - rs > 1 - 6^ = 3/4. 

Proof. The proof essentially consists of gazing at Figure [71 Almost every- 
thing in that figure comes from the fact that the geodesic fiow takes horo- 
cycles to horocycles; moreover, geodesic fiow for time t maps a segment of 
positive horocycle of length / to one of length e~*/, and a segment of negative 
horocycle of length / to one of length e*/. 

Two points X, y with y G Sy^{6/t, 6) fiow under the geodesic fiow for time 
logt to two points x' = x(7(logt) and y' = yg{logt)] note that y' G 5'x'(5, S), 
so certainly d{x', y') < 26. Then under positive horocycle fiow (for different 
times) these points fiow to points 

x" = x'm+(s5) and y" G S^"- 

By the argument in the caption of figure [TJ there exists a universal constant 
C such that d{x",-y") < 2C(i(x, y). Finally, use the geodesic fiow back, i.e., 
for time — logt, to find points 

x'" = xM+(s), y'" = yM+(ax(y, s)). 

Since backwards geodesic fiow in a single unstable manifold is contracting, 
we find d{x"', y'") < 2C5. 
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Figure 7: The skinny "rectangle" S'x(f,5) becomes under the geodesic flow 
for time logt the "square" 5'xg(iogt)(^, (5), and the box Wx{j, 5, St) becomes 
the box W:^g(iogt){S,6,6). The geometry of W:x.g{iogt){S,S,S) is standard: it 
depends only on S. In particular, the positive horocyclic flow from the bottom 
5'x(f , 5) of the box is defined since S < 1/2, hence C°°, hence Lipschitz with 
a universal constant C. 

6 Geometry of hyperbolic surfaces and cusps 

Let X be a complete hyperbolic surface. If such a surface is not compact, 
it has finitely many cusps. Every cusp c is surrounded by closed horocycles, 
and the open region bounded by the horocycle of length 2 is a neighborhood 
Nc isometric to the region 2Z\{y > 1} that is embedded in X, moreover, if 
c, c' are distinct cusps, then Nc fl Nc' = 0. If X has finite area and since 
each of the disjoint neighborhoods Nc of the cusps has area 2, there are only 
finitely many cusps. |Hub06] . 

We know everything about the standard cusp {y > 1}/2Z, in particular 
that any geodesic that enters it will leave it again unless it goes directly to 
the cusp, i.e., unless it is a vertical line. Thus the same holds for all Nc- 
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horocycles of length 2 




Figure 8: A non-compact complete hyperbolic surface is always non-compact 
in the same way: it has cusps c with disjoint standard neighborhoods 
isometric to {y > 1}/2Z, hence bounded by horocycles of length 2. Note 
that the only way a geodesic ^{t) can stay in such a neighborhood for all 
t >to is to head straight to the cusp. Each cusp has a stable manifold in X, 
and the geodesies that do not return infinitely many times to X^. — X — UcN^ 
are those that belong to one of these stable manifolds. 

If X has finite area, then the complement of these neighborhoods is com- 
pact: 

X,^X- □ 

cusps c of X 

is a compact set. Denote by Xc the corresponding part of X. The injectivity 
radius is bounded below on Xc, so there is a number 5x > such that for 
every x G Xc the box VFx(5x, ^x, ^x) is embedded in X. 

Now, suppose that X has finite measure. Then if {x, ^) e X is a point 
through which the positive horocycle is not periodic, the geodesic through 
(x,^) does not go forward to a cusp and hence must enter Xc infinitely many 
times. 

All periodic horocycles are homotopic to horocycles surrounding cusps. 
Indeed, if we apply geodesic flow to a positive horocycle, it will become 
arbitrarily short, thus will either be contained in a neighborhood of a cusp 
or in a contractible subset of X. No horocycle in such a contractible subset 
is closed, thus the horocycle is homotopic to a horocycle surrounding a cusp. 

Thus, the set Px C X of points defining periodic positive horocycles is 
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the union of the stable manifolds of the cusps (for the geodesic flow). 



Lemma 6 The set Px has measure zero in X for the measure cux- 

Proof. There are finitely many cusps, and each has a stable manifold which 
is a smooth immersed surface, certainly of 3-dimensional measure 0. ■ 

Although we have used the fact that X has finite area, the result is true 
for every complete hyperbolic surface, since such a surface can have only 
countably many cusps. 

7 A sequence of good times 

In this section we prove a result, still a bit weaker than theorem [3l though 
it does prove theorem [3] when X is compact. 

Theorem 7 Let X he a complete hyperbolic surface of finite area, X he its 
unit tangent bundle. For all x ^ Px, there then exists a sequence T„ — > cxo 
such that for any function f G Cc(X) we have 



The proof will take the remainder of this section. 

Proof. Let T„ be any increasing sequence tending to oo such that ^g{t) E Xc. 
Such a sequence exists because x ^ Px. 

Choose e > 0, and / G Cc(X); without loss of generality we may assume 
sup 1/1 = 1 and that e < 1 . 

We have already defined 6x- We need two more 6's, to be specified in 
lemmas [8] and [91 

Lemma 8 There exists Sj > such that for all t > 0, if z E Sx{Sf/t,6f) 
and < s <t, then 



Proof. This follows immediately from proposition [5] and the uniform conti- 
nuity of /. ■ 




|/(xM+(s)) - /(zu+(ax(z, s)))\ < e. 
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Lemma 9 There exists such that for all t > 0, if z E Sy^{5a/t^5Q) and 
< s < t, then 

|ax(z, s) — 1| < e. 

Proof. One could derive this from the imphcit function theorem, but we 
might as well use our explicit formula ([2]) for a. For z = ^u^{r)g{u) G 
Syi{Sa/t,6a) we have 

e"(l — rsj^ 
and since |r| < Sa/t, \u\ < 6a and s <t, 



Clearly we can choose 6a so that 

1 



e^-(l + 6a 



< e, 



- 1 



< e. 



Set 6 = inf (5x, 6f, 6a), and rj = co'x(W^x(<5, 6, e6)). 

Proposition 10 There exists a T and a set Y C X with uj^{Y) > 1 ~ rj 
such that for allT >T and all y eY we have 



^£ f{yu+{t))dt- Jjdu^ 



< e. 



Proof. By the ergodic theorem, the family of functions 



1 r 

griy) = 7^ j f{yu+{t))dt 



converges almost everywhere as T — > oo, and since horocycle flow is ergodic, 
it converges almost everywhere to fdux. (this is where we use Theorem 
d]). By Egorov's theorem, there exists a set Y of measure at least 1 — rj such 
that the Qt converge uniformly on Y; omitting a set of measure from Y, 
the family qt converges uniformly to fduj^ on Y . This is the assertion of 
Proposition [TOl ■ 



We can now choose 
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1. an no such that > f ior n > Uo- Then for n > no we can select: 

2. a sequence Yn £ Y fl Wy^{6/Tn, S, eSTn). Indeed, we have 

uj^{W4S/Tn,S,eSTn)) = v, 

since it is the inverse image of Wxu+(r„)((^, ^5) by the geodesic flow at 
time T„. We have 

Wx(Vrx«+(T„)(5,5,e5)) = 7] 

since x-u+(T„) G Xc and 5 < 5x • Geodesic flow for a fixed time is 
a measure-preserving diffeomorphism, so Wx{S /Tn, S, eSTn) must inter- 
sect Y which has volume > 1 — r). 

3. sequences z„ e S:^{d/Tn,S) and e'„ < e such that z„-u_|_(e^5T„) = y„. 
This is just what it means to say y„ G IVxlf^/^n, e<52n)- 

The organizing principle is now to write for n > Uq 

^ r f{^u+{t))dt- I f{w)u;x{dw) < 

■^n Jo ^X 

1 f^" 1 Z""^" / \ 



+ 



— f{z^u+{s))ds- — /(z„'u+(s))(i, 

-Ln Jo Jo 



+ 



/(y„M+(s))(is 



1 /"^" 1 
— / /(z„'u+(s))(is - - 

Jo Jo 

^ [ f{ynU+{s))ds- / /(w)wx(c?w) 
-'n JO -/X 



+ 



To get from the second summand on the right to the third, we use the 
change of variables formula, setting s — Q;x(z„,i)): 

f\ZnU+{a^{Zn,t))ja'^{Zn,t)dt= I f{ZnU+{s))ds. 



Each of the five terms above needs to be bounded in terms of e. 
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1. Since S < Sf, we have |/(x'u_|_(t)) — /(z„'u+(«x(zri, t)))| < e, so 



T 

■J- 7 



n Jo 



Tn 



< e. 



2. Since 5 < we have 



T 



T 



n Jo 



1 Z"^" 

< — / sup I/I |1 -<(z„,t)|c?t < e. 



3. From S < Sa, so \a' — 1| < e, we get that (1 — e)T!„ < ax(z„,T„) < 
(1 + e)T„ and hence 



TfT / /(z„M+(s))rfs - — 

-Ln Jo -'■n Jo 



f{ZnU+{s))ds 



< e. 



4. The points z„ and are on the same positive horocycle, a distance 
e'^Tn apart for some < e. This leads to 



f{ZnU+{s))ds- — f{ynU+{s))ds 
J-n JO ^0 

— / /(z„M+(s))rfs - / /(z„M+(s))rfs 



5. Since G Y and T„ > T, we have 



/ fiynU+{s))ds- I /(w)^x(c?w 



< e. 



This ends the proof of theorem [3 □ 



19 



8 Proving equidistribution 



The Tn are chosen to be a sequence of times tending to infinity such that 
^giTn) G Xc. Thus if X is compact, the sequence T„ is an arbitrary sequence 
tending to infinity, and so equidistribution is proved in that case. Moreover, 
clearly theorem [7] shows that all non-periodic horocycles are dense in X. But 
it doesn't quite prove that they are equidistributed when X is not compact; 
perhaps a horocycle could spend an undue amount of time near some cusp, 
and we could choose a different sequence of times also tending to infinity 
which would emphasize the values of / near that cusp. In fact, we will 
see in Section 9 that something like that does happen for random walks on 
horocycles. 

We will now show that this does not happen for the horocycle flow itself. 

Proposition 11 Let v he a probability measure on X invariant under the 
positive horocycle flow and such that z^(Px) = . Then v = cjx- 

Proof. Without loss of generality, we can assume that u is ergodic for the 
positive horocycle flow since any invariant probability measure is a direct 
integral of ergodic invariant probability measures with //^(Px) = 0, so 
uniqueness for such invariant ergodic measures implies uniqueness for all 
such invariant measures. Choose / G Cc(X), and let x G X be a typical 
point for u, i.e., a point of X — Px such that 

t 



[ fdv = lim - / /(xM+(s))(is. 
Jx t Jo 



By the ergodic theorem, this is true of //-almost every point, so such points 
X certainly exist. Such a point is one for which the horocycle flow is not 
periodic, so theorem [7| asserts that there exists a sequence t„ — > cx) such that 

/ fdu= lim i / /(xM+(s))rfs = lim / /(xM+(s))rfs = / /c/t^x- 

Jx t Jq n^co tn Jo Jx 

Since this equality is true for every / G Cc(X), we have z/ = cjx- ■ 

Now suppose that for some x G X — Px and some / G Cc(X), we do not 
have 

1 /•* 

/citux = hm - / /(xu+(s))(is. 
x t Jo 
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We can consider the set of probability measures Vt defined by 
/ /^^t = 7 / f{xu+{s))ds. 

J X J 

On a non-compact space, the Riesz representation theorem says that set of 
Borel measures is the dual of the Banach space Co(X) of continuous functions 
vanishing at oo with the sup norm. The collection of probability measures Vt 
is a subset of the unit ball, and the unit ball is compact for the weak topology. 
So if \mit->oo 7^ ^x, there exists a measure v ^ ujy^ and a sequence tj ^ oo 
such that 

lim Vti = ^ 

in the weak topology. 

Clearly u is invariant under the horocycle flow and ergodic. So it might 
seem that tux 7^ contradicts proposition [111 There is a difficulty with this 
argument when X is not compact. In that case the probability measures do 
not form a closed subset of the unit ball of Co(X)*; consider for instance the 
measures 6{x — n) on M; as n 00 they tend to in the weak topology. 
Technically, the problem is that we can't evaluate measures on the continuous 
function 1, since this function doesn't vanish at infinity. 

We need to show that i/ is a probability measure. This follows from 
proposition [T^ below. For p < 2, let X'' C X be the compact part of X in 
which all periodic horocycles have length > p. So Xc = X^ 



Proposition 12 . For any e > 0, there exists p > such that for all 
X G X — Px we have 

1 [' 

lim - / lxp(x'U+(s))(is > 1 — e. 

t^oo t Jq 

Proof. If c is a cusp of X, let A^^ be the neighborhood of c bounded by 
the horocycle of length p. Recall from our discussion of the geometry of 
hyperbohc surfaces, that A'^^ is isometric to a standard object: the part 
of (2Z)\H where y > 1. Set 7 to be the Moebius transformation 7(z) = 
z/{z + 1); the standard neighborhood is then isometric to the part of (7)\H 
where + (?/ — 1)^ < 1. Moreover, any horocycle that doesn't tend to the 
cusp is equivalent by a change of variable commuting with 7 to a horizontal 
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Figure 9: In if a neighborhhod of a cusp bounded by a horocycle corresponds 
to a disc tangent to the x-axis. In the figure on the left, we have represented 
the cusp by {■y)\H] without loss of generality we may set 7(z) = z/{l + z). 
Then the disc of radius 1 centered at i corresponds to the neighborhood of 
the cusp bounded by the horocycle of length 2, and the disc of radius p/2 
centered at ip/2 corresponds to the neighborhood bounded by a horocycle 
of length p. A horocycle that enters this neighborhood but does not go to 
the cusp can be, without loss of generality, represented by a line of equation 
y = e] it goes deeper and deeper into the cusp as e — > 0. The ratio of times 
spent in A^'' to the time spent in A^^ — A^'' does not become large as the 
horocycle goes deeper in the cusp, but tends to a ratio depending only on 
p, which tends to as p tends to 0. As horocycles go deeper and deeper 
in the cusp, they spiral more and more tightly in A^^ — A^^ and still spend 
approximately the same fraction of time in A^^ — A^'' as in A^^. 



line. Of course lengths on such a horizontal line y = e depend on e, but ratios 
of lengths are the same as ratios of euclidean lengths. 

A careful look at figure [H] shows that if a horocycle starts in Xc, goes deep 
in the cusp, and comes out again, then the ratio of time spent in A^'' to time 
spent in A^^ — A^'' is 

^^^^ - ^ +0{e). (3) 



V2^^- ^^^^ V2-v^ 

Any non-periodic horocycle will eventually enter X^; by taking p sufficiently 
small, we can assure that afterwards it will spend a proportion of its time 
< e outside of X''. Proposition [T2] follows. ■ 
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Consider the measures 



T 



i'^,T / fixu+{t)dt). 







Proposition 13 The accumulation set of {v^^t^T > 0} consists entirely of 
probability measures. 

Proof. Every accumulation point fi of the ^'x,t in Cq(X.)* is a measure, and 
the only thing to show is that /i(X) = 1. Clearly /^(X) < 1, since for any 
/ G Cq{X) and any x, T we have 



fi^u+{t))dt< 



To see that /i(X) > 1, take e > and p as in proposition [T^ We can 
then find a function / G Co(X) which coincides with Ixp on X^ and satisfies 
< / < 1 everywhere. Then 

/i(X) = sup (4) 

3eCo(x) ||5'l|oo 

> / /rf/i>liminfi f f{^u+{t))dt (5) 

1 

> liminf - / lxp(xM+(t))rft > 1 - e. (6) 



There is one last thing to check. 

Proposition 14 A measure fi in the limit set of {vy^^TiT > 0} with x ^ Px 
satisfies yu(Px) = 0. 

Proof. Suppose /i(Px) > 0, set e = /i(Px)/3 and use proposition [T2] to find 
a corresponding p. Find a compact subset Q C Px with /i(Q) > |/i(Px), 
and find a time T such that 

Q^(r) n x^ = 0. 

This is possible because Px consists of points in the stable manifolds of the 
cusps, so each point can be moved off X^, and since Q is compact it will 
leave X'' under the geodesic fiow at some time T. 
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Let U be a neighborhood of Q such that \Jg(T) fl = 0. For this 
neighborhood U of Q, as for any neighborhood, there exists a sequence of 
times Tn oo such that 

A{tG [0,r^] I x»+(t) €U} ^ 1 

where A is hnear measure. Then the horocycle t ^ :Kg(T)u^{t) must spend 
the same proportion of its time in \Jg{T), hence outside X''. But every non- 
periodic horocycle spends at least a proportion 1 — /i(Px)/3 in X — X'', and 
this is a contradiction. ■ 



9 Horocycle flow on the modular surface 

Let r be the 2-congruence subgroup of SL2(M), so that X = r\ SL2M is the 
unit tangent bundle over X = T\H, which is the 3-times punctured sphere. 
Denote by ttx : H ^ X and ttx : X ^ X respectively the projections from 
H = PSL(2, M) onto X = r\ PSL(2, M) and from X onto X. 

Lemma 15 The hyperbolic surface X has area 27t, and the subset X — X^ 
has area 3p. 

We leave the proof of this lemma to the reader. 

It follows from lemma [TS] that for every Xq ^ Px there exists for every 
sequence p„ 0, for every e > and for every n sufficiently large, a time 

such that 7rx(xoM+(T„)) e X - X^". 

To use this result, we need to understand the region in H corresponding 
to XP. 

Lemma 16 The inverse image in H of X — X^ is the union of the horodisc 
Imz > 2/p, and the union, for all rational numbers p/q of the discs of radius 
p/{4:q^) tangent to the real axis at p/q. 

Choose a G M — Q, and consider the horocycle in X which is the image of 
the horocycle zoM+(t) in H, where zq = a + 2z, i.e., the image of the horocycle 



24 



represented by the circle of radius 1 tangent to the real axis at the irrational 
number a. A straightforward computation shows that 



ZoM+(T) 



IT 



a — 



T2 + 1 



+ i 



T2 + 1' 



Set Pn = 1/n. This horocycle is not periodic, so it must enter X — X''" 
at a sequence of times T„ < (1 + e) (^)- Interpreting the cusps as rational 
numbers, this means that there exists an infinite sequence of rational numbers 
Pn/Qn and times T„ < (1 + e)^ such that 



a 



Pn 



2T„ 



< 



T 



n + l- 2nql 



<(l + 6l 



7m 



TT 



3g2 



This is of course nothing to boast about. It has been known for over 100 
years that for every irrational number a, there exist infinitely many coprime 
numbers numbers p„, g„ such that 



a 



Pv 



< 



and that l/-\/5 is the smallest number for which this is true |Kin64j . Our 
analysis only gives the constant 7r/3, too large by a factor of more than 2. 

One reason to take an interest in this result despite its weakness is that 
Ratner's theorem has many generalizations to situations where methods lead- 
ing to the sharp results about diophantine approximations of irrational num- 
bers are not available. In all settings, Ratner's theorem has "diophantine" 
consequences. 

Clearly we cannot do better than improve the constant for all horocycles. 
But we can use the theory of diophantine approximations to improve the 
results above for almost every horocycle. In particular we can apply the 
following theorem. 

Theorem 17 {Kind 4^ If g{x) : R*5_ ^ M zs a function such that g{x)/x 
is increasing, then for almost every a, there exist infinitely many coprime 
integers p, q such that \a — - \ < 



if and only if the series 

oo ^ 



diverges. 
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Let us see what this says about horocycles; we will specialize to the 
case where g{n) = nlog{n). For almost every xq, the horocycle xof/+ lifts 
to a horocycle zqU+ in H tangent to the x-axis at an irrational number a 
belonging to the set of full measure from theorem [T71 Changing the time 
parameterization by a constant, we may assume that zq = (a + 2iR, —2iR) 
since this is in any case the worst point on the horocycle zof/+. 



Figure 10: We can lift the horocycle xot/+ to H; without loss of generality 
we may assume that the cusp c is at 0, and that the stabilizer of the cusp is 
generated hy z z/{z+l). In that case, the horocycle of length 2 lifts to the 
circle of radius 1 centered at i, and the horocycle of length L lifts to the circle 
of radius L/2 centered at Li/ 2. We may take Xq = 7rx(xo) to be anywhere on 
this horocycle; it will be convenient to place it at "^^2^4'^^ - In that case, one 

fundamental domain on the horocycle goes from "^^2^4''^^ to Our 
modified horocycle will join xq to the point p > on the real axis. It is 
much easier to estimate lengths on this horocycle if we send p to infinity by 
a parabolic transformation that fixes 0, and hence all the horocycles tangent 
to the real axis at 0. If we perform this parabolic transformation, the point 
Xq moves to a point yo on its horocycle which is approximately —p + ip'^/L, 
and the horocycle is a horizontal line, approximately the line y = p'^/L. 

Let Pn/Qn be one of the good approximations to a guaranteed by theorem 
[T71 Let pn be the radius of the negative horocycle surrounding the cusp 
corresponding to Pn/Qn when the point zon+(T„) is on the vertical line x = 
Pn/q-n- Further let us write 
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Then we have T„ = — and rin = and by lemma ?? it follows that 

l/{ql\ogqn) 2 



Moreoever rjn = R — \/\R^ — in) ~ %^ from which we can derive that 



C2 

1 ^ 



We have proved the following: 

Theorem 18 On the modular surface, for every e > and for almost every 
horocycle ^oU+, there exists a sequence p„ — and times T„ < (1 + e) — i 

log ^ 

such that XoM+(T„) G X - X''". 

The theorem means that almost every nonperiodic horocycle enters X — 
X''" much earlier than is implied for every nonperiodic horocycle by equidis- 
tribution. 

This leads to a surprising result due to Breuillard |Bre05j : although non- 
periodic horocycles are equidistributed, any uncentered random walk on the 
set of non-periodic horocycles almost surely is not. 

Theorem 19 Let p he a probability measure on M with finite expectation and 
variance: 







/oo /"OO 
tjj,{dt) < oo and = {t — aYjj,{dt) < oo. 
■oo J —oo 



Denote by /x*™ the mth convolution of fi with itself. If b > 0, there exists 
a function f G Cc(X) with J^f{'x)ux{d:K) = 1 such that for almost every 
xo G X t(;e have 

/oo 
/(xoM+(t))/X^'"(rft) = 0. 

Proof. Let a G M be an arbitrary element of the full Lebesgue measure 
set guaranteed by theorem [T71 Suppose that the horocycle zof/+ through 
zq = {a + 2iR, —2iR) in H projects to a horocycle in X containing xq. 
Now choose an approximating sequence of coprime qn) G to a as 
guaranteed by theorem [17] above, and let T„ be the associated sequence of 
times. The measure fi*"^ is approximately the Gaussian of mean ma and 
standard deviation x/mb. Let us choose an m such that: 
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1 . ma = Tn for one of the T„ given in theorem | 

2. the standard deviation of a{fi*''^) ~ by/m is much smaller than l/^/p^ 

This first condition can obviously be satisfied, and the second is also 
straightforward since 




bx/m b\ — < 



a Va\ pn log q. 



and a{fi*"^) will be much smaller than as soon as qn is large enough. 

Recall that it takes time of the order 1/^^ ioT a horocycle to get from X — 
Xf to X"^. Thus for the m found above, there are many, say c(m), standard 
deviations of /i**" around the mean am where xoM+([am— c(m), am+c{m)]) G 
X — X^. It follows that if / G Cc(X) satisfies /(x)co'x(c?x) = 1 but / has 
its support in X^, we have 

/oo 
/(xoM+(t))/i*"^(rft) = 0. 

This proves that the random walk is not equidistributed. ■ 
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